An Exploration of Graph Based Approaches in Data Stream Mining

نویسندگان

  • D. Aarthi
  • P. Pavithra
چکیده

Progressive advancements in the data stream mining have paved way for many algorithms to solve the problem of concept drift, a serious problem due to the wavering nature of the real time concepts. Concepts tend to change with time therefore concept drift is unavoidable in data stream mining but efficient algorithms can be designed to detect the concept drift and solve the problem. This paper presents a comparison based frame work of five algorithms that are designed to elucidate the drift. The five algorithms presented are OLIN (online information network), VFDT (very fast decision tree), CVFDT (concept adapting very fast decision tree), UFFT (ultra fast forest trees), and CBDT (concept based decision tree). All the above said frame works are based on sliding window algorithm and decision trees. Each time when the concept changes both the old and new set of concepts or data streams are compared to check their rigidity, the more rigid one continues to exist and the other is eradiated. To determine this disparity, decision trees are used as models. Construction of the trees and ability to grow the tree is the distinguishing factor of all the algorithms. The aim of the paper is to effectively discuss the algorithms and suggest the best out of the list, comparing their efficiencies and performances depending on the construction of the tree, its size and various other factors. Keywords-Data stream mining; Decision tree; Sliding window; Training model; Fuzzy network; Linear separability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Ti- anomaly in stream sediment geochemistry using of stepwise factor analysis and multifractal model in Delijan district, Iran

In this study, 115 samples taken from the stream sediments were analyzed for concentrations of As, Co, Cr, Cu, Ni, Pb, W, Zn, Au, Ba, Fe, Mn, Sr, Ti, U, V and Zr. In order to outline mineralization-derived stream sediments, various mapping techniques including fuzzy factor score, geochemical halos and fractal model were used. Based on these models, concentrations of Co, Cr, Ni, Zn, Ba, Fe, Mn, ...

متن کامل

Identification of Geochemical Anomalies Using Fractal and LOLIMOT Neuro-Fuzzy modeling in Mial Area, Central Iran

The Urumieh-Dokhtar Magmatic Arc (UDMA) is recognized as an important porphyry, disseminated, vein-type and polymetallic mineralization arc. The aim of this study is to identify and subsequently determine geochemical anomalies for exploration of Pb, Zn and Cu mineralization in Mial district situated in UDMA. Factor analysis, Concentration-Number (C-N) fractal model and Local Linear Model Tree (...

متن کامل

CRFA-CRBM: a hybrid technique for anomaly recognition in regional geochemical exploration; case study: Dehsalm area, east of Iran

Identification of geochemical anomalies is a significant step during regional geochemical exploration. In this matter, new techniques have been developed based on deep learning networks. These simple-structure-networks act like our brains on processing the data by simulating deep layers of thinking. In this paper, a hybrid compositional-deep learning technique was applied to identify the anomal...

متن کامل

Application of continuous restricted Boltzmann machine to detect multivariate anomalies from stream sediment geochemical data, Korit, East of Iran

Anomaly separation using stream sediment geochemical data has an essential role in regional exploration. Many different techniques have been proposed to distinguish anomalous from study area. In this research, a continuous restricted Boltzmann machine (CRBM), which is a generative stochastic artificial neural network, was used to recognize the mineral potential area in Korit 1:100000 sheet, loc...

متن کامل

An Optimum Selection of Simulated Geological Models by Multi-Point Geostatistics and Multi-Criteria Decision-Making Approaches; a Case Study in Sungun Porphyry-Cu deposit, Iran

An accurate modeling of sophisticated geological units has a substantial impact on designing a mine extraction plan. Geostatistical simulation approaches, via defining a variogram model or incorporating a training image (TI), can tackle the construction of various geological units when a sparse pattern of drilling is available. The variogram-based techniques (derived from two-point geostatistic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013